A Novel MST based Multi-Prototype Clustering Algorithm
نویسنده
چکیده
Clustering is the process of partitioning the data objects such that the degree of similarity among the objects in the same partition is more than the degree of dissimilarity between the objects of different classes. There are many applications of clustering like medicine, marketing, insurance, image analysis etc. The squared-error clustering algorithm uses single prototype to represent a cluster. However, the algorithm is incapable of detecting clusters with complex shapes. To resolve this issue, multi-prototype clustering algorithm has been proposed which can discover clusters of complex shapes, but it uses squared error clustering to produce prototypes. This paper proposes multi-prototype clustering algorithm based on Minimum Spanning Tree. Minimum Spanning Tree is known to be capable of detecting clusters with irregular boundaries. Two closest prototypes are merged based on Euclidean distance. At each step of merging, weighted standard deviation is calculated. The result is output for the set of clusters for which the weighted standard deviation is least. Experimental results for real data demonstrate the effectiveness of the proposed algorithm.
منابع مشابه
Multi-layer Clustering Topology Design in Densely Deployed Wireless Sensor Network using Evolutionary Algorithms
Due to the resource constraint and dynamic parameters, reducing energy consumption became the most important issues of wireless sensor networks topology design. All proposed hierarchy methods cluster a WSN in different cluster layers in one step of evolutionary algorithm usage with complicated parameters which may lead to reducing efficiency and performance. In fact, in WSNs topology, increasin...
متن کاملA Multi-Objective Approach to Fuzzy Clustering using ITLBO Algorithm
Data clustering is one of the most important areas of research in data mining and knowledge discovery. Recent research in this area has shown that the best clustering results can be achieved using multi-objective methods. In other words, assuming more than one criterion as objective functions for clustering data can measurably increase the quality of clustering. In this study, a model with two ...
متن کاملNGTSOM: A Novel Data Clustering Algorithm Based on Game Theoretic and Self- Organizing Map
Identifying clusters is an important aspect of data analysis. This paper proposes a noveldata clustering algorithm to increase the clustering accuracy. A novel game theoretic self-organizingmap (NGTSOM ) and neural gas (NG) are used in combination with Competitive Hebbian Learning(CHL) to improve the quality of the map and provide a better vector quantization (VQ) for clusteringdata. Different ...
متن کاملMinimum spanning trees for gene expression data clustering.
This paper describes a new framework for microarray gene-expression data clustering. The foundation of this framework is a minimum spanning tree (MST) representation of a set of multi-dimensional gene expression data. A key property of this representation is that each cluster of the expression data corresponds to one subtree of the MST, which rigorously converts a multi-dimensional clustering p...
متن کاملClustering gene expression data using a graph-theoretic approach: an application of minimum spanning trees
MOTIVATION Gene expression data clustering provides a powerful tool for studying functional relationships of genes in a biological process. Identifying correlated expression patterns of genes represents the basic challenge in this clustering problem. RESULTS This paper describes a new framework for representing a set of multi-dimensional gene expression data as a Minimum Spanning Tree (MST), ...
متن کامل